On Variable Constraints in Privacy Preserving Data Mining

نویسندگان

  • Charu C. Aggarwal
  • Philip S. Yu
چکیده

In recent years, privacy preserving data mining has become an important problem because of the large amount of personal data which is tracked by many business applications. In many cases, users are unwilling to provide personal information unless the privacy of sensitive information is guaranteed. A recent framework performs privacy preserving data mining by using a condensation based approach. In this framework, the privacy of all records is treated homogeneously. It is therefore inefficient to design a system with a uniform privacy requirement over all records. We discuss a new framework for privacy preserving data mining, in which the privacy of all records is not the same, but can vary considerably. This is often the case in many real applications, in which different groups of individuals may have different privacy requirements. We discuss a condensation based approach for privacy preserving data mining in which an efficient method is discussed for constructing the condensation in a heterogeneous way. The heterogeneous condensation is capable of handling both static and dynamic data sets. We present empirical results illustrating the effectiveness of the method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tools for Privacy Preserving Distributed Data Mining

Privacy preserving mining of distributed data has numerous applications. Each application poses different constraints: What is meant by privacy, what are the desired results, how is the data distributed, what are the constraints on collaboration and cooperative computing, etc. We suggest that the solution to this is a toolkit of components that can be combined for specific privacy-preserving da...

متن کامل

Privacy-Preserving Data Mining: Development and Directions

This article first describes the privacy concerns that arise due to data mining, especially for national security applications. Then we discuss privacy-preserving data mining. In particular, we view the privacy problem as a form of inference problem and introduce the notion of privacy constraints. We also describe an approach for privacy constraint processing and discuss its relationship to pri...

متن کامل

Developments and Directions

This article first describes the privacy concerns that arise due to data mining, especially for national security applications. Then we discuss privacy-preserving data mining. In particular, we view the privacy problem as a form of inference problem and introduce the notion of privacy constraints. We also describe an approach for privacy constraint processing and discuss its relationship to pri...

متن کامل

An Efficient Cryptographic Privacy Preserving Algorithm for Association Rule Mining over Heterogeneous Database

Recently, there are many privacy and security issues in data mining. A considerable research has focused on developing new data mining algorithms that incorporate privacy constraints. Their is a conflict between privacy and data mining. As most types of data mining produce summary results that do not reveal information about individuals. But the process of data mining may use private data, lead...

متن کامل

Privacy Preserving Categorical Data Analysis with Unknown Distortion Parameters

Randomized Response techniques have been investigated in privacy preserving categorical data analysis. However, the released distortion parameters can be exploited by attackers to breach privacy. In this paper, we investigate whether data mining or statistical analysis tasks can still be conducted on randomized data when distortion parameters are not disclosed to data miners. We first examine h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005